A codebook adaptation algorithm for SCHMM using formant distribution
نویسندگان
چکیده
This paper describes a codebook adaptation process improving the performance of speaker adaptation. The proposed method is performed prior to Bayesian speaker adaptation method using the formant distribution of adaptation data. The reference codebook is adapted to represent the formant distribution of a new speaker. The average recognition rate of Bayesian adaptation is improved from 91.4% to 95.1% using the proposed method. The proposed method is e ective particularly when there exists a large mismatch between the reference codebook and a target speaker in feature space. In this cases the average recognition rate is 95.0% while 89.9% is obtained when only Bayesian adaptation is performed.
منابع مشابه
COMPARISON OF A NEW HYBRID CONNECTIONIST-SCHMM APPROACH WITH OTHER HYBRID APPROACHES FOR SPEECH RECO - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
This paper compares a newly proposed hybrid connectionist-SCHMM approach [5] with other hybrid a p proaches. In the new approach a multilayer perceptron (MLP) replaces the conventional codebooks of semicontinuous HMMs. The MLP is therefore trained on s w d k d basic elements (phones and phone parts) in such a way that the outputs of the network estimate the a posteriori probabilities of these e...
متن کاملTitle On-line adaptation of the SCHMM parameters based on the segmental quasi-bayes learning for speech recognition
In this correspondence, on-line quasi-Bayes adaptation of the mixture coefficients and mean vectors in semicontinuous hidden Markov model (SCHMM) is studied. The viability of the proposed algorithm is confirmed and the related practical issues are addressed in a specific application of on-line speaker adaptation using a 26-word English alphabet vocabulary.
متن کاملAn expectation maximization approach for formant tracking using a parameter-free non-linear predictor
This paper presents a new approach for formant tracking using a parameter-free non-linear predictor that maps formant frequencies and bandwidths into the acoustic feature space. The approach relies on decomposing the speech signal into two components: the first component captures the mapping between formants and acoustic observations, while the second component is intended to capture the residu...
متن کاملOn-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition
In this correspondence, on-line quasi-Bayes adaptation of the mixture coefficients and mean vectors in semicontinuous hidden Markov model (SCHMM) is studied. The viability of the proposed algorithm is confirmed and the related practical issues are addressed in a specific application of on-line speaker adaptation using a 26-word English alphabet vocabulary.
متن کاملPhoneme-Dependent Speech Enhancement
The majority of current speech enhancement systems are based on generalized signal-to-noise ratio dependent weighting rules and do not take into account the characteristics of the actual speech sound being processed. The following contribution is concerned with phoneme-specific speech enhancement methods that apply specially tailored signal processing methods. The first signal processing algori...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996